SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



<i) APPLICANT: Zancope-Oliveia, Rosely M. 
et al . , 

5 (ii) TITLE OF INVENTION: .Nucleic Acids of the M Antigen 

Gene of Histoplasma Capsulatum, Isolated and 
Recombinant ly- Produced 

Antigens, Vaccines and Antibodies, Method 

(iii) NUMBER OF SEQUENCES: 13 

10 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fitch, Even, Tabin & Flannery 

(B) STREET: 135 South LaSalle Street, Suite 900 

(C) CITY: Chicago 

(D) STATE: IL 
15 ( E ) COUNTRY: USA 

(F) ZIP: 60603-4277 

^3 (v) COMPUTER READABLE FORM: 

m ( A > MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
N- 20 (C) OPERATING SYSTEM: Windows 

w g (D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
m (B) FILING DATE: 

- |s 25 (C) CLASSIFICATION: 

a ivii) PRIOR APPLICATION DATA: 

;2 (A) APPLICATION NUMBER: 

W (B) FILING DATE: 

C3 

W 



(viii) ATTORNEY /AGENT INFORMATION: 
30 (A) NAME: Kaba, Richard A 

(B) REGISTRATION NUMBER: 30,562 

(C) REFERENCE /DOCKET NUMBER: 6314/62527 

(ix) TELECOMMUNICATION INFORMATION: 
■ (A) TELEPHONE: 312-372-7842 

35 (B) TELEFAX: 312-372-7848 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3862 base pairs ' 

40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

45 (A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: var. capsulatum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

A^nSJSP GGCTC CGATA ACTTTGCTTT ATCCAAGGGT CTCGGCGAAT GCCAGGTGCC 60 
c n ^CGATCTAT ATTTTGAAGT TTATCACCTC AATGGCTTCA CCCCATGACG CACCTTTTAT 120 
50 TTTTATTTTC ATTCATCTTC TCTGTGGCAA ACATGCAGGT ATGCGAGCTC TCGaSctSs 180 
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4 



10 



15 



20 



25 



30 



■ 35 

s. 

CP 

P 40 



45 



50 



55 



60 



GGTGTGGCCC 
TTTTGGACCT 
TCCCCATTTG 
TTGGTCCCTA 
ATTGATTATT 
TCAGGGGGTA 
AGCCTGTCCC 
CCATGACACT 
ACAGTCGGTG 
TGGAAATAGA 
TGATCATGAG 
ATTTTACGCA 
TCACATCCTA 
AG C AG AC AC C 
CTGCTCGCGA 
AGCATTATAT 
ACCTAGGCTG 
TTCAGGACGC 
AAATTCCCCA 
GCTCATTGCA 
ATGTTGATGG 
TGGTCAAGTT 
AGGCTCTTGG 
CTGGAAGGTA 
TGTTTCTCTG 
CTTTATATAC 
CAAGTTTGAT 
CACCCCAATC 
GCAGATCATG 
GAAGCGTATA 
ATTTCACGGA 
ATCGCCATGG 
ATAACAACAA 
TTGACCCAAT 
CCCTCTAAAC 
CAACCGGACC 
AGTGCGCGAG 
CTCACTCACG 
CGTGCGGAGT 
CCTCGCCCGC 
CTACCACAAC 
CGGGCTGAAA 
CCGGGCCGCG 
TGATCCCCAA 
CGTGATCGTC 
GCTCAGGATT 
TAGCAATGAA 
GGACCAGCCC 
GGACGGATTG 
AGGTTTGGGG 
CTGTTTTTCC 
GCCGGTGTCA 
GTGTAACTAT 
TCTCACATTC 
GAGACAAATA 
TCAGATATTA 
TGAATATTTA 
TGTTTTAGCT 
TGACTATCTG 
ATAGATAAAA 
TACATATCAT 
ACAACACCTT 



TTGATGCATA 
CCTCCCGCCA 
GGTTCCCTAT 
CCCTCTCCAA 
TGTCTTCTTC 
AAAATGCGGT 
TACATGTCAG 
CTCTCCGACC 
CTAACAACGG 
GGCCCAACTC 
AGGGTATGTA 
GGTTCCTGAG 
TAATAACTGG 
AGTATTCGTG 
TATCCACGGA 
CGTGGTAGTC 
ACTACTCGGC 
TATTCAATTC 
GGCTGCAACT 
TGCCCTCTTC 
GTGGGGCGTC 
TCGCTGGAAG 
CGGAAAGAAT 
CCCTGAGTGG 
CTCTGTCGGT 
GTTTTACTCA 
TTCGATCTAT 
GGAAAAATGG 
GTTGGTCCAC 
TCTAAATATA 
TGACCCTTTG 
AGGTCCCAAC 
TCGCGACGGT 
CGATTTGTAT 
ACGGCCGCAT 
CATAACAGAG 
CTCAGCCCGA 
GTCTTCGAGA 
GAAACCGTGC 
CGCGTCGCGC 
AAGGCAACCG 
ATCGCCCTCC 
TTTAACAGCG 
CGCGGCGTGA 
GTCGGCGGCC 
ATTACGGATG 
GCCCTTCGTG 
GGTGTGTATA 
ACGGCATATC 
CGCAAATATG 
ATCTTTGGTT 
AGCTTCANGA 
AATAATTTAT 
AATCTATATT 
ATGATAGATT 
TCTATCATGT 
TATGCTGTAA 
TAAAATCTCA 
ATAAAAATGT 
CCTGAACGAT 
TCCCATATGA 
CAAAAAGGAT 



TGGTTTATTT 
TTCTTTCCAA 
AGGCCACTGC 
CACTATCTGC 
AGCATCTTTT 
CGCTCAAGCT 
GGGAGATGCC 
CTACGGACCA 
ACGTGGGTGG 
TACTTGAGGA 
GATACAAAAT 
CGCGCCGTCC 
TCGAATATCA 
CGGTTTTCTA 
TTTGCGACCC 
ATACTCATAA 
AATGTAGATA 
CCTGATTTGA 
GCACATGATA 
TGGGCAATGT 
CATACCTTCC 
ACCCTCCAAG 
CCCGACTTCC 
GAGGTAAGAT 
TGCTCTTTTC 
TATAGCTGGG 
TAGATCCCAC 
TCTTGAACCG 
CCCCTATATA 
TTTCCACAGT 
CTTCAGGGCC 
TTCGAGCAAC 
GCTGGTAAGC 
AGAGTATTAA 
ATACACCCAA 
GATTCTTCAC 
GCTTCAACGA 
AGCAATTCCT 
GTAAGAACGT 
TAGCTATCGG 
TCCCCATCGG 
TGACAAGAGA 
CCAACAACAA 
ACATGACCTA 
TGCTCACGAG 
CATACGCGTA 
ACGTCCTTAT 
TTTCCAACGA 
GGTTCTTGAA 
GGTTTACTAC 
GAGGTAATAT 
GGCCTAATTA 
AATAACTAAT 
TGATCCTTGT 
AACAAATAAT 
TGTAATGATA 
ATGTAGGGAA 
CTAAGGTCGG 
CTGTATTTCC 
TTAGCCCCTG 
CCAAAAACTA 
CC 



ATAGCCGCCC 
ACTTCGTGCG 
GTGCTCCACT 
ATATGTAATA 
TGTCTCGAGC 
TATACTCGCC 
TAGCGGTCAG 
GTTTCTTAGC 
TCCCATCGAG 
TTTTATCTTC 
ATGTGACCGT 
ATGCTCGAGG 
CAGCCGCATC 
CAGTCGCTGG 
GTCTGTATAC 
CAGCACAACA 
TCGTCGGAAA 
TTCACGCTGT 
CGGCATGGGA 
CAGGACATGG 
GACTTGTCAC 
GAAGAGCGGG 
ATCGACAAGA 
ATGATTCCCC 
GTCTTTTTCT 
CTTTCAATTG 
CAAAATCATC 
AAACCCAAAA 
TTTGGAATAT 
TCCAACCAGG 
GCTTGTACTC 
TGCCGATCAA 
TACTTCTCAC 
CATCCCCGTC 
CTCAATGAGC 
CGCACCTGGG 
CGTCTGGTCC 
CGTCAACGCC 
CATCATCCAG 
CGTCGAACCC 
CACCTTCGGC 
CGACGGTAGC 
AGTAGATATC 
TTCCGGCGCC 
CGCCTCAACG 
TGGAAAGCCC 
GGCCGCTGGT 
TGTGAGTGAG 
TCGGTTCCCG 
CCCCCCCCCC 
TGCAGATATC 
ATTTGAAGAG 
AACTTATAAT 
CCTTTGTAGC 
TGCACACCCA 
CCTCAAAAAT 
GAGCGTACCA 
TCGTGTCTAT 
GCTTCACGAC 
TTGGGGGAAA 
AAATAGATAT 



GGAAGCCCTG 
TCCGTTTCCC 
CAAGAAGGGT 
TATATCGATA 
AAGCTTACTC 
TCGGCGGGTG 
AAAGGCCCCC 
AAGTTTTACA 
GACCAACACA 
CGCCAGAAGA 
GTTGCAAATC 
AGCTGGTGCC 
CTTCTTGAAC 
TAGCAGAGGC 
CGATGAAGGC 
AATATGAATA 
CAACGTTCCA 
CAAGCCGCAA 
TTTCCTCAGC 
AATCCCTCGC 
CGACGAGGGC 
CCTGGTATGG 
CCTCTGGGAT 
CAAATCATTA 
ATATCTTCAA 
GTGAATGAAG 
CCAGAAGAAC 
AGTTATTTTG 
GAATACATGT 
TCATGTAGTT 
CTACCTTGAC 
CAGACCCCGC 
CTACCATGTC 
TGCACAGGAC 
AACGGATTCC 
CGTATGGTAA 
CAACCGCGTC 
ATGCGCTTCG 
CTGAACCGCG 
CCATCCCCGG 
ACGAATCTCC 
TTCACGATCG 
GTCCTAGTGG 
GACGGCTCGA 
CAATACCCAA 
GTTGGCGCCG 
GGGGATGCGT 
GCCTACGTTA 
TTGGATAGAA 
CCCTTTTTTT 
AGTAAATTGC 
GAGGTTGAAG 
TAATGTCTAT 
TGTTTAAATA 
ATAGGCCTTC 
GCCACAAGCT 
TCCAAATAAC 
TTGAAATGGC 
GCATGTTATG 
TAGGGGTTAG 
ATATATATAT 



GCCTGTTAAA 
ATTTCCCCCC 
CCCAGTCAAT 
TCTAACTGCC 
CACGTTCAAT 
TTGTTTCTGC 
TCGATCGCCG 
TTGACGATGA 
GCCTGAAGGC 
TTCAACACTT 
CGCTAATTCA 
CATGGCGTAT 
GCGGCAGGAA 
AGTGTTGACT 
AATTTTGGTA 
CAAACCCAGG 
GTCTTCTTCA 
CCAGACAGTG 
CAGCAGCCCA 
TCAATGCGTC 
AACTCGACCT 
GAAGAGGCAC 
GCCATTGAAT 
GTTCTGACAG 
CTAAGACTGA 
CAGATCAATC 
TTGTTCCTTT 
CCGAAACTGA 
ATAGCTAGAT 
CGCGGAATCG 
ACTCAATTGA 
ATCCCATTCC 
AACTTCCATC 
AAATGTTCAT 
CACAACAAGC 
ATGGACCACT 
TCTTCTACAA 
AAAACTCCCA 
TCGACAACGA 
ACCCAACCTT 
TGCGGCTCGA 
CGGAGCAGCT 
GCTCATCGCT 
TCTTCGATGC 
GAGGTCGCCC 
TCGGTGACGG 
CGAATGGGCT 
GAAGSgTCTT 
GdT,TG<STATG 
TTTf CCTTTT 
GTTTACGAAA 
TGAAATCTTG 
TGTAATTTCC 
TAAGCCAAGA 
CCTCACGATA 
TGCCTGATAT 
CAGAAAAACA 
TGCGGCAAGC 
ACTTTCGAAT 
GGGGGCGAGC 
ATATATATAT 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
i620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3862 



65 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Pro Ser Gly Gin Lys Gly Pro Leu Asp Arg Arg His Asp Thr Leu 

1 5 10 15 

Ser Asp Pro Thr Asp Gin Phe Leu Ser Lys Phe Tyr He Asp Asp Glu 

5 20 25 30 

Gin Ser Val Leu Thr Thr Asp Val Gly Gly Pro He Glu Asp Gin His 

35 40 45 

Ser Leu Lys Ala Gly Asn Arg Gly Pro Thr Leu Leu Glu Asp Phe lie 
50 55 60 

10 Phe Arg Gin Lys He Gin His Phe Asp His Glu Arg Val Pro Glu Arq 

6 ;> 70 75 80 

Ala Val His Ala Arg Gly Ala Gly Ala His Gly Val Phe Thr Ser Tyr 

85 90 95 

Asn Asn Trp Ser Asn He Thr Ala Ala Ser Phe Leu Asn Ala Ala Gly 

15 100 105 110 

Lys Gin Thr Pro Val Phe Val Arg Phe Ser Thr Val Ala Gly Ser Arq 

„ , 115 120 125 

Gly Ser Val Asp Ser Ala Arg Asp He His Gly Phe Ala Thr Arg Leu 

11° 135 140 

Tyr Thr Asp Glu Gly Asn Phe Asp He Val Gly Asn Asn Val Pro Val 
145 150 155 160 

Phe Phe He Gin Asp Ala He Gin Phe Pro Asp Leu He His Aia Val 
165 170 175 

» oiz Lys Pro Gln Pro As P Ser Giu Ile Pro Gln A la Ala Thr Ala His Asp 

L % 25 180 185 190 

*P Tnr Aia Trp Asp Phe Leu Ser Gln Gln Pro Ser Ser Leu His Ala Leu 

m 195 200 205 

i'l phe Tr P Ala Met Ser Gly His Gly He Pro Arg Ser Met Arg His Val 

^„ 210 215 220 

*S 30 Asp Gly Trp Gly Val His Thr Phe Arg Leu Val Thr Asp Glu Gly Asn 
U 225 230 235 240 

! „ Ser Tnr Leu Vai Lys Phe Arg Trp Lys Thr Leu Gln Gly Arg Ala Gly 

V3 24 5 250 2 55 

10 o C Leu Val Trp Glu Glu Ala Gln Ala Leu Gly Gly Lys Asn Pro Asp Phe 

Jb 260 265 270 

s Hls Arg Gln Asp Leu Trp Asp Ala Ile Glu Ser Gly Arg Tyr Pro Glu 

P 275 280 285 

fft Tr P G i u Leu Gly Phe Gln Leu Val Asn Glu Ala Asp Gln Ser Lys Phe 

V£ . 290 295 300 

Q- 40 Asp Phe Asp Leu Leu Asp Pro Thr Lys He Ile Pro Glu Glu Leu Val 
IU 305 310 315 320 

m Pro Phe Thr Pro Ile Gly Lys Met Val Leu Asn Arg Asn Pro Lys Ser 

H 325 330 335 

M, Tyr Phe Ala Glu Thr Glu Gln Ile Met Phe Gln Pro Gly His Val Val 

45 340 345 350 

Arg Gly Ile Asp Phe Thr Asp Asp Pro Leu Leu Gln Gly Arg Leu Tyr 

355 360 365 

Ser Leu Asp Thr Gln Leu Asn Arg His Gly Gly Pro Asn Phe Glu 

370 375 380 

Gln Leu Pro Ile Asn Arg Pro Arg Ile Pro Phe His Asn Asn Asn Arq 
385 390 395 400 

Asp Gly Ala Gly Gln Met Phe Ile Pro Leu Asn Thr Ala Ala Tyr Thr 

405 410 415 

Pro Asn Ser Met Ser Asn Gly Phe Pro Gln Gln Ala Asn Arq Thr His 
55 420 425 430 

Asn Arg Gly Phe Phe Thr Ala Pro Gly Arg Met Val Asn Gly Pro Leu 

435 440 445 

Val Arg Glu Leu Ser Pro Ser Phe Asn Asp Val Trp Ser Gln Pro Arq 

450 455 460 

Leu Phe Tyr Asn Ser Leu Thr Val Phe Glu Lys Gln Phe Leu Val Asn 
465 470 475 480 

Ala Met Arg Phe Glu Asn Ser His Val Arg Ser Glu Thr Val Arq Lys 

485 490 495 

Asn Val Ile Ile Gln Leu Asn Arg Val Asp Asn Asp Leu Ala Arq Ara 
65 500 505 510 

Val Ala Leu Ala Ile Gly Val Glu Pro Pro Ser Pro Asp Pro Thr Phe 

. 525 520 525 

Tyr His Asn Lys Ala Thr Val Pro Ile Gly Thr Phe Gly Thr Asn Leu 

530 535 540 

Leu Arg Leu Asp Gly Leu Lys Ile Ala Leu Leu Thr Arg Asp Asp Gly 



50 



60 



70 
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545 550 555 560 

Ser Phe Thr He Ala Glu Gin Leu Arg Ala Ala Phe Asn Ser Ala Asn 

565 570 575 

Asn Lys Val Asp lie Val Leu Val Gly Ser Ser Leu Asp Pro Gin Arg 

580 585 590 

Gly Val Asn Met Thr Tyr Ser Gly Ala Asp Gly Ser lie Phe Asp Ala 

595 600 605 

Val lie Val Val Gly Gly Leu Leu Thr Ser Ala Ser Thr Gin Tyr Pro 

610 615 620 

Arg Gly Arg Pro Leu Arg lie He Thr Asp Ala Tyr Ala Tyr Gly Lys 
b " 630 635 640 

Pro Val Gly Ala Val Gly Asp Gly Ser Asn Glu Ala Leu Arg Asp Val 

645 650 655 

Leu Met Ala Ala Gly Gly Asp Ala Ser Asn Gly Leu Asp Gin Pro Gly 

660 665 670 

Val Tyr lie Ser Asn Asp Val Ser Glu Ala Tyr Val Arg Ser Val Leu 

675 680 685 

Asp. Gly Leu Thr Ala Tyr Arg Phe Leu Asn Arg Phe Pro Leu Asp Arg 

on c , 0 695 700 

2 0 Ser Leu Val 

705. 

(2) INFORMATION FOR SEQ ID NO: 3: 

M (i) SEQUENCE CHARACTERISTICS: 

*5 „ (A) LENGTH : 8 amino acids 

" (B) TYPE: amino acid 

fp (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: None 

.c 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

=U 30 Ser Asp Pro Thr Asp Gin Phe Leu 

in 1 5 

L (2) INFORMATION FOR SEQ ID NO: 4: 

f| (i) SEQUENCE CHARACTERISTICS: 

Il. c (A) LENGTH : 15 amino acids • 

Q 35 (B) TYPE: amino acid 

LJ <C) STRANDEDNESS: single 

™ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asp Phe He Phe Arg Gin Lys He Gin His Phe Asp His Glu Ara 
1 5 10 15 y 

(2) INFORMATION FOR SEQ ID NO:5: 



(i) SEQUENCE CHARACTERISTICS: 
- _ (A) LENGTH: 9 amino acids 
4 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
50 Thr Leu Gin Gly Arg Ala Gly Leu Val 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
c _ (A) LENGTH: 16 amino acids 

55 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Gin Ala Leu Gly Gly Lys Asn Pro Asp Phe His Arg Gin Asp Leu 
b 10 15 

5 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

i f\ (C) STRANDEDNESS : single 

10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ser Gly Arg Tyr Pro Glu 
1 5 

15 (2) INFORMATION FOR SEQ ID NO: 8: 

Q (i) SEQUENCE CHARACTERISTICS* 

V3 (A) LENGTH: 10 amino acids 

ffi (B) TYPE: amino acid 

-\ « n (C) STRANDEDNESS: single 

M 20 (D) TOPOLOGY: linear 

_E MOLECULE TYPE: None 

H? (xi) SEQUENCE DESCRIPTION: SEQ ID NO*8- 

tfi p ? e Asp phe Asp Leu Leu As p Fro Tnr L v s 

15 10 

Q 25 (2) INFORMATION FOR SEQ ID NO: 9: 

?S (i) SEQUENCE CHARACTERISTICS* 

Q- (A) LENGTH : 14 amino acids 

1. 1 (B) TYPE: amino acid 

< c > STRANDEDNESS : single 
Cnf oU (D) TOPOLOGY: linear 

fa (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

lie He Pro Glu Glu Leu Val Pro Phe Thr Pro He Gly Lvs 
1 5 10 

35 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
An (C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AARAAYCCVG AYTTY 

15 

45 (2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
cn (C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 
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(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
TTNCCDATNG TRAA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
CGGAATCCTC CGACCCTACG GA 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
ACCAAGCTTC TATCCAACGG GAACCGA 
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, 09/674195 
Rec'dPCT/rD 26OCT200Q 



1 



SEQUENCE LISTING 



<110> THE GOVERNMENT OF THE UNITED STATES OF AMERICA, AS REPRI 

THE 

DEPARTMENT OF HEALTH AND HUMAN SERVICES CENTERS FOR DISI 

CONTROL AND 

PREVENTION 

ZANCOPE-OLIVEIRA, ROSALY M. 

LOTT, TIMOTHY J. 

MAYER, LEONARD W. 

REISS, ERROL 

DEEPE , JR., GEORGE S. 

<12 0> NUCLEIC ACIDS OF THE M ANTIGEN GENE OF 

HISTOPLASMA CAPSULATUM, ISOLATED AND RECOMBINANTLY- PRODUCED 
ANTIGENS, VACCINES AND ANTIBODIES, METHODS AND KITS FOR 
DETECTING HISTOPLASMOSIS 



<130> 65798 / US 

<140> NYA 

<141> 1999-04-27 

<150> U.S. 60/083,676 
<151> 1998-04-30 

<150> PCT/US99/09151 
<151> 1999-04-27 

<160> 13 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 3863 

<212> DNA 

<213> Histoplasma capsulatum 



ggatcctgct ggctccgata actttgcttt atccaagggt ctcggcgaat gccaggtgcc 
atcgatctat attttgaagt ttatcacctc aatggcttca ccccatgacg caccttttat 
ttttattttc attcatcttc tctgtggcaa acatgcaggt atgcgagctc tggaccctgg 
ggtgtggccc ttgatgcata tggtttattt atagccgccc ggaagccctg gcctgttaaa 
ttttggacct cctcccgcca ttctttccaa acttcgtgcg tccgtttccc atttcccccc 
tccccatttg ggttccctat aggccactgc gtgctccact caagaagggt cccagtcaat 
ttggtcccta ccctctccaa cactatctgc atatgtaata tatatcgata tctaactgcc 
attgattatt tgtcttcttc agcatctttt tgtctcgagc aagcttactc cacgttcaat 
tcagggggta aaaatgcggt cgctcaagct tatactcgcc tcggcgggtg ttgtttctgc 
agcctgtccc tacatgtcag gggagatgcc tagcggtcag aaaggccccc tcgatcgccg 
ccatgacact ctctccgacc ctacggacca gtttcttagc aagttttaca ttgacgatga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



<400> 1 



acagtcggtg ctaacaacgg acgtgggtgg tcccatcgag gaccaacaca gcctgaaggc 720 

tggaaataga ggcccaactc tacttgagga ttttatcttc cgccagaaga ttcaacactt 780 

tgatcatgag agggtatgta gatacaaaat atgtgaccgt gttgcaaatc cgctaattca 840 

attttacgca ggttcctgag cgcgccgtcc atgctcgagg agctggtgcc catggcgtat 900 

tcacatccta taataactgg tcgaatatca cagccgcatc cttcttgaac gcggcaggaa 960 

agcagacacc agtattcgtg cggttttcta cagtcgctgg tagcagaggc agtgttgact 1020 

ctgctcgcga tatccacgga tttgcgaccc gtctgtatac cgatgaaggc aattttggta 1080 

agcattatat cgtggtagtc atactcataa cagcacaaca aatatgaata caaacccagg 114 0 

acctaggctg actactcggc aatgtagata tcgtcggaaa caacgttcca gtcttcttca 1200 

ttcaggacgc tattcaattc cctgatttga ttcacgctgt caagccgcaa ccagacagtg 1260 

aaattcccca ggctgcaact gcacatgata cggcatggga tttcctcagc cagcagccca 1320 

gctcattgca tgccctcttc tgggcaatgt caggacatgg aatccctcgc tcaatgcgtc 1380 

atgttgatgg gtggggcgtc cataccttcc gacttgtcac cgacgagggc aactcgacct 1440 

tggtcaagtt tcgctggaag accctccaag gaagagcggg cctggtatgg gaagaggcac 1500 

aggctcttgg cggaaagaat cccgacttcc atcgacaaga cctctgggat gccattgaat 1560 

ctggaaggta ccctgagtgg gaggtaagat atgattcccc caaatcatta gttctgacag 162 0 

tgtttctctg ctctgtcggt tgctcttttc gtctttttct atatcttcaa ctaagactga 1680 

ctttatatac gttttactca tatagctggg ctttcaattg gtgaatgaag cagatcaatc 1740 

caagtttgat ttcgatctat tagatcccac caaaatcatc ccagaagaac ttgttccttt 18 00 

caccccaatc ggaaaaatgg tcttgaaccg aaacccaaaa agttattttg ccgaaactga 1860 

gcagatcatg gttggtccac cccctatata tttggaatat gaatacatgt atagctagat 1920 

gaagcgtata tctaaatata tttccacagt tccaaccagg tcatgtagtt cgcggaatcg 1980 

atttcacgga tgaccctttg cttcagggcc gcttgtactc ctaccttgac actcaattga 2040 

atcgccatgg aggtcccaac ttcgagcaac tgccgatcaa cagaccccgc atcccattcc 2100 

ataacaacaa tcgcgacggt gctggtaagc tacttctcac ctaccatgtc aacttccatc 2160 

ttgacccaat cgatttgtat agagtattaa catccccgtc tgcacaggac aaatgttcat 222 0 

ccctctaaac acggccgcat atacacccaa ctcaatgagc aacggattcc cacaacaagc 2280 

caaccggacc cataacagag gattcttcac cgcacctggg cgtatggtaa atggaccact 2340 

agtgcgcgag ctcagcccga gcttcaacga cgtctggtcc caaccgcgtc tcttctacaa 24 00 

ctcactcacg gtcttcgaga agcaattcct cgtcaacgcc atgcgcttcg aaaactccca 2460 

cgtgcggagt gaaaccgtgc gtaagaacgt catcatccag ctgaaccgcg tcgacaacga 252 0 

cctcgcccgc cgcgtcgcgc tagctatcgg cgtcgaaccc ccatccccgg acccaacctt 2580 

ctaccacaac aaggcaaccg tccccatcgg caccttcggc acgaatctcc tgcggctcga 2640 

cgggctgaaa atcgccctcc tgacaagaga cgacggtagc ttcacgatcg cggagcagct 2700 

ccgggccgcg tttaacagcg ccaacaacaa agtagatatc gtcctagtgg gctcatcgct 2 760 

tgatccccaa cgcggcgtga acatgaccta ttccggcgcc gacggctcga tcttcgatgc 2820 

cgtgatcgtc gtcggcggcc tgctcacgag cgcctcaacg caatacccaa gaggtcgccc 2880 

gctcaggatt attacggatg catacgcgta tggaaagccc gttggcgccg tcggtgacgg 2 940 

tagcaatgaa gcccttcgtg acgtccttat ggccgctggt ggggatgcgt cgaatgggct 3000 

ggaccagccc ggtgtgtata tttccaacga tgtgagtgag gcctacgtta gaagtgtctt 3060 

ggacggattg acggcatatc ggttcttgaa tcggttcccg ttggatagaa gctaaggtat 3120 

gaggtttggg gcgcaaatat gggtttacta cccccccccc cccctttttt tttttccttt 3180 

tctgtttttc catctttggt tgaggtaata ttgcagatat cagtaaattg cgtttacgaa 3240 

agccggtgtc aagcttcang aggcctaatt aatttgaaga ggaggttgaa gtgaaatctt 3300 

ggtgtaacta taataattta taataactaa taacttataa ttaatgtcta ttgtaatttc 3360 

ctctcacatt caatctatat ttgatccttg tcctttgtag ctgtttaaat ataagccaag 3420 

agagacaaat aatgatagat taacaaataa ttgcacaccc aataggcctt ccctcacgat 3480 

atcagatatt atctatcatg ttgtaatgat acctcaaaaa tgccacaagc ttgcctgata 354 0 

ttgaatattt atatgctgta aatgtaggga agagcgtacc atccaaataa ccagaaaaac 3600 

atgttttagc ttaaaatctc actaaggtcg gtcgtgtcta tttgaaatgg ctgcggcaag 3660 

ctgactatct gataaaaatg tctgtatttc cgcttcacga cgcatgttat gactttcgaa 3720 

tatagataaa acctgaacga tttagcccct gttgggggaa ataggggtta ggggggcgag 3 780 



J 




3 



ctacatatca ttcccatatg accaaaaact aaaatagata tatatatata tatatatata 
tacaacacct tcaaaaagga tec 



3840 
3863 



<210> 2 
<211> 705 
<212> PRT 

<213> Histoplasma capsulatum 
<400> 2 

Met Pro Ser Gly Gin Lys Gly Pro Leu Asp Arg Arg His Asp Thr Leu 

15 10 15 

Ser Asp Pro Thr Asp Gin Phe Leu Ser Lys Phe Tyr lie Asp Asp Glu 

20 25 30 

Gin Ser Val Leu Thr Thr Asp Val Gly Gly Pro lie Glu Asp Gin His 

35 40 45 

Ser Leu Lys Ala Gly Asn Arg Gly Pro Thr Leu Leu Glu Asp Phe lie 

50 55 60 

Phe Arg Gin Lys lie Gin His Phe Asp His Glu Arg Val Pro Glu Arg 
65 70 75 80 

Ala Val His Ala Arg Gly Ala Gly Ala His Gly Val Phe Thr Ser Tyr 

85 90 95 

Asn Asn Trp Ser Asn lie Thr Ala Ala Ser Phe Leu Asn Ala Ala Gly 

100 105 110 

Lys Gin Thr Pro Val Phe Val Arg Phe Ser Thr Val Ala Gly Ser Arg 

115 120 125 

Gly Ser Val Asp Ser Ala Arg Asp lie His Gly Phe Ala Thr Arg Leu 

130 135 140 

Tyr Thr Asp Glu Gly Asn Phe Asp lie Val Gly Asn Asn Val Pro Val 
145 150 155 160 

Phe Phe lie Gin Asp Ala lie Gin Phe Pro Asp Leu lie His Ala Val 

165 170 175 

Lys Pro Gin Pro Asp Ser Glu lie Pro Gin Ala Ala Thr Ala His Asp 

180 185 190 

Thr Ala Trp Asp Phe Leu Ser Gin Gin Pro Ser Ser Leu His Ala Leu 

195 200 205 

Phe Trp Ala Met Ser Gly His Gly lie Pro Arg Ser Met Arg His Val 

210 215 220 

Asp Gly Trp Gly Val His Thr Phe Arg Leu Val Thr Asp Glu Gly Asn 
225 230 235 240 

Ser Thr Leu Val Lys Phe Arg Trp Lys Thr Leu Gin Gly Arg Ala Gly 

245 250 255 

Leu Val Trp Glu Glu Ala Gin Ala Leu Gly Gly Lys Asn Pro Asp Phe 

260 265 270 

His Arg Gin Asp Leu Trp Asp Ala lie Glu Ser Gly Arg Tyr Pro Glu 

275 280 285 

Trp Glu Leu Gly Phe Gin Leu Val Asn Glu Ala Asp Gin Ser Lys Phe 

290 295 300 

Asp Phe Asp Leu Leu Asp Pro Thr Lys lie lie Pro Glu Glu Leu Val 
305 310 315 320 

Pro Phe Thr Pro lie Gly Lys Met Val Leu Asn Arg Asn Pro Lys Ser 

325 330 335 

Tyr Phe Ala Glu Thr Glu Gin He Met Phe Gin Pro Gly His Val Val 



340 345 350 

Arg Gly He Asp Phe Thr Asp Asp Pro Leu Leu Gin Gly Arg Leu Tyr 

355 360 365 

Ser Tyr Leu Asp Thr Gin Leu Asn Arg His Gly Gly Pro Asn Phe Glu 

370 375 380 

Gin Leu Pro He Asn Arg Pro Arg He Pro Phe His Asn Asn Asn Arg 
385 390 395 400 

Asp Gly Ala Gly Gin Met Phe He Pro Leu Asn Thr Ala Ala Tyr Thr 

405 410 415 

Pro Asn Ser Met Ser Asn Gly Phe Pro Gin Gin Ala Asn Arg Thr His 

420 425 430 

Asn Arg Gly Phe Phe Thr Ala Pro Gly Arg Met Val Asn Gly Pro Leu 

435 440 445 

Val Arg Glu Leu Ser Pro Ser Phe Asn Asp Val Trp Ser Gin Pro Arg 

450 455 460 

Leu Phe Tyr Asn Ser Leu Thr Val Phe Glu Lys Gin Phe Leu Val Asn 
465 470 475 480 

Ala Met Arg Phe Glu Asn Ser His Val Arg Ser Glu Thr Val Arg Lys 

485 490 495 

Asn Val He He Gin Leu Asn Arg Val Asp Asn Asp Leu Ala Arg Arg 

500 505 510 

Val Ala Leu Ala He Gly Val Glu Pro Pro Ser Pro Asp Pro Thr Phe 

515 520 525 

Tyr His Asn Lys Ala Thr Val Pro He Gly Thr Phe Gly Thr Asn Leu 

530 535 540 

Leu Arg Leu Asp Gly Leu Lys He Ala Leu Leu Thr Arg Asp Asp Gly 
545 550 555 560 

Ser Phe Thr He Ala Glu Gin Leu Arg Ala Ala Phe Asn Ser Ala Asn 

565 570 575 

Asn Lys Val Asp lie Val Leu Val Gly Ser Ser Leu Asp Pro Gin Arg 

580 585 590 

Gly Val Asn Met Thr Tyr Ser Gly Ala Asp Gly Ser He Phe Asp Ala 

595 600 605 

Val He Val Val Gly Gly Leu Leu Thr Ser Ala Ser Thr Gin Tyr Pro 

610 615 620 

Arg Gly Arg Pro Leu Arg He He Thr Asp Ala Tyr Ala Tyr Gly Lys 
625 630 635 640 

Pro Val Gly Ala Val Gly Asp Gly Ser Asn Glu Ala Leu Arg Asp Val 

645 650 655 

Leu Met Ala Ala Gly Gly Asp Ala Ser Asn Gly Leu Asp Gin Pro Gly 

660 665 670 

Val Tyr He Ser Asn Asp Val Ser Glu Ala Tyr Val Arg Ser Val Leu 

675 680 685 

Asp Gly Leu Thr Ala Tyr Arg Phe Leu Asn Arg Phe Pro Leu Asp Arg 
690 695 700 

Ser 
705 



<210> 3 
<211> 8 
<212> PRT 

<213> Histoplasma capsulatum 



5 



<400> 3 

Ser Asp Pro Thr Asp Gin Phe Leu 
1 5 



<210> 4 
<211> 15 
<212> PRT 

<213> Histoplasma capsulatum 



<400> 4 

Asp Phe He Phe Arg Gin Lys He Gin His Phe Asp His Glu Arg 
1 5 io 15 

<210> 5 
<211> 9 
<212> PRT 

<213> Histoplasma capsulatum 



<400> 5 

Thr Leu Gin Gly Arg Ala Gly Leu Val 
1 5 



<210> 6 
<211> 16 
<212> PRT 

<213> Histoplasma capsulatum 
<400> 6 

a Gin Ala Leu Gly Gly Lys Asn Pro Asp Phe His Arg Gin Asp Leu 
5 io 15 

<210> 7 
<211> 6 
<212> PRT 

<213> Histoplasma capsulatum 



<400> 7 
Ser Gly Arg Tyr Pro Glu 
1 5 



<210> 8 
<211> 10 
<212> PRT 

<213> Histoplasma capsulatum 
<400> 8 

Phe Asp Phe Asp Leu Leu Asp Pro Thr Lys 
1 5 io 



<210> 9 
<211> 14 
<212> PRT 



6 



<213> Histoplasma capsulatum 

<400> 9 

He He Pro Glu Glu Leu Val Pro Phe 
1 5 

<210> 10 
<211> 15 
<212> DNA 
<213> Unknown 

<220> 

<223> Primer 

<400> 10 
aaraayccvg aytty 

<210> 11 

<211> 14 

<212> DNA 

<213> Unknown 

<220> 

<223> primer; n= a, t, g or c 

<400> 11 
ttnccdatng traa 

<210> 12 
<211> 22 
<212> DNA 
< 2 1 3 > Unknown 

<220> 

<223> Primer 

<400> 12 
cggaatcctc cgaccctacg ga 

<210> 13 
<211> 27 
<212> DNA 
<213> Unknown 

<220> 

<223> Primer 

<400> 13 
accaagcttc tatccaacgg gaaccga 



Thr Pro He Gly Lys 
10 



15 



14 



22 



